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Secure Cascade Channel Synthesis 

Sanket Satpathy and Paul Cuff 


Abstract 

We consider the problem of generating correlated random variables in a distributed fashion, where communication is 
constrained to a cascade network. The first node in the cascade observes an i.i.d. sequence X" locally before initiating 
communication along the cascade. All nodes share bits of common randomness that are independent of X". We consider secure 
synthesis - random variables produced by the system appear to be appropriately correlated and i.i.d. even to an eavesdropper who is 
cognizant of the communication transmissions. We characterize the optimal tradeoff between the amount of common randomness 
used and the required rates of communication. We find that not only does common randomness help, its usage exceeds the 
communication rate requirements. The most efficient scheme is based on a superposition codebook, with the first node selecting 
messages for all downstream nodes. We also provide a fleeting view of related problems, demonstrating how the optimal rate 
region may shrink or expand. 

I. Introduction 

This paper studies the synthesis of correlated random variables under a total variation constraint, known as strong coordination 
121 or channel synthesis 13. Given an i.i.d. input to the system, we would like to generate a correlated output at a remote 
location - this can be understood as approximation of a conditional probability distribution. Due to the stochastic nature of the 
objective, the cooperating parties benefit from access to common randomness, in addition to their communication capabilities. 

The optimal tradeoff between communication and common randomness was derived by Cuff im and Bennett et al. ||5| for the 
case of two random variables and the results have been extended in other work ||6l, Q, El, m, El- One particularly pleasing 
aspect of the above tradeoff was that it recovered two familiar measures of correlation as the required rate of communication 
— mutual information and Wyner’s common information na — in the presence of abundant and no common randomness, 
respectively. 

Requiring that the synthesized joint distribution be close to the desired joint distribution in total variation is a more stringent 
constraint than empirical coordination O, El i-e. jointly typical input and output sequences. On the other hand, we enjoy the 
benefit of the synthesized sequences being immune to statistical tests designed to detect i.i.d. correlated sequences El- This is 
a simple consequence of the fact that hypothesis tests will produce identically distributed outcomes for distributions that are 
extremely close in total variation. Also, total variation can be bounded by entropy ina Theorem 17.3.3], which is extremely 
useful for proving converse results. 

The above observations have led to applications in secrecy and game theory a, ini- Schieler and Cuff lfT4l study secrecy 
with causal disclosure of information to the eavesdropper - their achievability scheme hinges on a distributional approximation 
result. Winter ca and Chitambar et al. m consider secure generation of correlated random variables along the lines of the 
channel synthesis problem El, with the notable difference that no information sequence is provided as an external input. We 
refer the interested reader to El, im for a more thorough discussion of the properties of total variation and comparison with 
other metrics. 


ni?o bits 



Fig. 1. The i.i.d. sequence X'^ is given by nature. Messages Mi and M 2 are sent along the cascade at rates Ri and R 2 . Common randomness K is shared 
by all 3 nodes at rate Rq. We want {X'^, Z'^) i.i.d. correlated and independent of the messages (Mi, M 2 ). 
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This work extends 13 to a network setting. Such extensions have been previously considered by 0, a albeit without tight 
results. The cascade channel synthesis problem shown in Fig. provides a starting point for the theory of strong coordination 
over general networks. Distributed networks for control and sensing are seemingly ubiquitous - the power grid, road networks, 
server farms and the internet are a few prominent examples ifTSll . In many control settings, one would like the actions at 
various nodes to be independent of the communication so the actions cannot be anticipated by malicious eavesdroppers. We 
consider the problem of synthesizing sequences that appear to be i.i.d. and appropriately correlated even from the perspective 
of an onlooker who can see messages sent over the communication channel but does not have access to the source of common 
randomness. 

The cascade structure for communication is especially relevant for large distributed networks where there is a cost per unit 
distance associated with sending messages. In such a setting, it would be more economical for nodes to forward appropriate 
messages locally in a cascade fashion rather than having a central node talk to all the other nodes. Also in some settings, there 
might be a hierarchy among the nodes that inherits the cascade structure (for example, in parallel computation, interaction takes 
place in a master-slave hierarchy). In section II-E, we characterize the optimal rate region for the task assignment problem, 
where 3 out of m > 3 tasks have to be distributed uniformly at random to the 3 nodes, with the first node’s tasks chosen at 
random by nature. In section III-A, this example is used to demonstrate how the rate region is sensitive to the particulars of 
the secrecy constraint. 

In the context of source coding, the cascade network has received some prior attention. Yamamoto ED considered the lossy 
transmission of a source pair {X,Y) to receivers along a cascade, with X and Y intended for separate receivers. Vasudevan, 
Tian and Diggavi GOil consider the case when X = Y i.e. both receivers desire the same data, and they have additional 
side-information. Permuter and Weissman ED also explore the role of side-information at the initial nodes and provide an 
explicit solution for Gaussian random variables. Cuff, Su and El Gamal ll22l study a variant where X and Y are provided at 
the first and second nodes of the cascade, while the receiving node seeks a function f{X,Y). 

All of the above works can be broadly categorized as problems of empirical coordination ||2] Theorem 11]. In contrast, we 
study strong coordination |2l in the cascade network. Cuff, Permuter and Cover ||2l Theorem 5] solve the analogous problem for 
empirical coordination on arbitrarily long cascade networks. In the same paper, they make a conjecture about the relationship 
between optimal schemes for empirical coordination and strong coordination. In section III-B, we provide an example that 
refutes the most general version of their conjecture. 

In this work, we only consider random variables with finite alphabets and assume that every node in the cascade has 
sufficient local randomness. Interested readers may refer to El for a treatment of synthesis with limited local randomness. 
Bloch and Kliewer Il23l studied the cascade synthesis problem without secrecy constraints. They provide inner and outer bounds 
that do not match in general. In fact, we find that the secrecy constraint makes a complete characterization of the rate region 
tractable. Vellambi et al. ||24l, ESl also study the tradeoff w.r.t. local randomness at the nodes without secrecy constraints. They 
provide a general achievability scheme that is proved to be optimal in certain regimes of common randomness, with additional 
assumptions on the encoder, or when the distribution to be synthesized enjoys a Markov chain structure. The functional regime 
considered in coincides with optimal operation in the secure cascade synthesis problem. Einally, Il26l uses ideas from the 
present work to devise secure source coding schemes for a cascade network. 

II. Main Result 

A. Notation 

We represent both random variables and probability distribution functions with capital letters, but only letters P and Q 
are used for the latter. The conditional distribution of the random variable Y given the random variable X is denoted by 
Py\x{v\x)- We may abbreviate this as Py\x- We also abbreviate the product distribution Hti Px\Y{xi\yi) Y\Px\y when 
the y" sequence is clear from context. We use the script letter X B x to denote the alphabet of X. Sequences Xi,..., Xn are 
denoted by Xp, with A" := AT". The set {1,.. ., m} is denoted by [m], 

Markov chains are denoted by X — Y — Z implying the factorization Pxyz = PxyPz\y- The factorization Pxy = PxPy, 
i.e. independence, is denoted by A _L Y. We define the total variation distance as 

\\Px-Qx\\^^ ^ (1) 

X 

The convex hull of a set A is denoted by Conv(A). The empirical distribution (normalized counts) of a vector x” is denoted 
by e where 

Pa;" (y) = - ^ l{a;i=y} 


(2) 


for y G X, and denotes the fc-dimensional simplex. Wyner’s common information iflOl between random variables X and 
Y is denoted by C{X;Y), and defined as 

C{X-,Y)= min I(X,Y;U). (3) 

U-.X-U-Y 

B. Problem-Specific Definitions 

Although we solve the synthesis problem for arbitrarily long cascades, we restrict the presentation here to a cascade of three 
nodes for simplicity. We state our most general result in section 11-F. 

We have the i.i.d. source A" ~ Qx and we would like to synthesize the channel Y[Qyz\x- Messages sent along the 
cascade communication links are denoted by Mi G [2"^^] and M 2 G [2”^^]. The common randomness shared by all nodes 
K is uniformly distributed on [2"^°] and independent of X'^. 

Definition 1. A ( 2 "«o, 2 ’^^^, 2 "^^ , n) secure cascade channel synthesis (SCCS) code consists of randomized encoding functions 

: A” X [2”-^“] ^ [2”-^i], 

Q(enc) . pnitij ^ prartoj |■ 2 "^ 2 J 

and randomized decoding functions 

Qidec) . X [2”-^^] X ^ 3^", 

: [2^^^] X [2”'^«] ^ Z". 


The randomization used for each of these functions is assumed to be independent. In other words, a randomized function can 
be thought of as a function with an additional argument that is a random variable (suppressed in the notation). Each function 
uses a random variable which is independent of everything else in the problem setting. We have Mi — Fn(X'^, K), M 2 = 
G^r"\Mi,K), Y^ = G^n""\Mi,M 2 ,K) and = iF„(M 2 ,iT). 

Note that node 2 has both encoding and decoding capability. The induced joint distribution of a (2"^“, 2"^^, 2"«=,n) SCCS 
code is the joint distribution Px^,Y'^,Z",k,Mi,M 2 above specifications. 

Definition 2. A sequence of (2”^“, 2”^% 2"^^, n) SCCS codes for n > 1 is said to achieve Qyz\x if the induced joint 
distributions have marginals that satisfy 


lim 

n—Yoo 


n 

Px^Y^Z^MxM2 ~ PmiM2 Qxyz 

t=l 


= 0 . 

TV 


(4) 


Definition 3. A rate triple {Rq, Ri, R 2 ) is said to be achievable if there exists a sequence of (2”^°, 2"^^, 2”^^, n) SCCS 
codes that achieves Qyz\x- 

Definition 4. The secure synthesis rate region C is the closure of achievable triples (i?o, i?i, i? 2 )- 


C. Main Result 


The characterization of the set of achievable rate triples is given in terms of the following set 


where 


(i?0, i?i, i?2) c 


Sd = 


D = 


Px,Y,Z,U,V 


]R3 


^Px,Y,z,u,v e D s.t 

i?l 

> 

I{x- U, V) 

i?2 

> 

IiX;V) 

i?o 

> 

I{X,Y,Z-U,V) 



Y, Z) ~ QxQyz\x, 


X-{U,V)-Y, 
{X,Y,U)-V-Z, 
|v|<|A||3;||z| + 3, 
IZ^I^IA-IIJ^II^IIVI+S 


(5) 


(6) 


Also, let D' = Do {Px.y,z,u,v '■ H{^\U) = 0} denote the restriction of D to joint distributions where E is a function of U. 
Theorem 1. 

C = Sd=Sd'. (7) 


The achievability proof is based on superposition coding - the first node picks messages for all links and downstream nodes 
merely forward the appropriate messages. The details are presented in section IV. 










D. Remarks 


A startling feature of the optimal encoding scheme, which is implied by the optimal region above, is that there is no loss 
of generality in assuming that the second message is a function of the first i.e. the local randomness used in synthesizing 
is not essential to correlating with However, it is precisely this feature that obstructs solution to the cascade 

channel synthesis problem with no eavesdropper ll23l . In the absence of secrecy constraints, the system may benefit when 
intermediate nodes generate messages locally - easily seen when X _L (Y, Z) i.e. the first node is passive. 

For the above case, our result reduces to the region i?o > C{Y ; Z), obtained by picking [/ = 0 and V Y X, with Y—V—Z 
to minimize I{V]Y,Z). Communication is not necessary, or useful in any way, even between the active nodes! In the non- 
secure synthesis problem, the optimal rate region is given hy R 2 + Rq > C{Y; Z). We see that communication (originating at 
the second node) is still not necessary but can be useful. Finally, consider a modified secure synthesis problem, where the Y"' 
sequence is provided by nature (with X still independent of (Y, Z)). In this case, the optimal rate region 13 III.C] is described 
by the constraints 

R2>I{Y;V), (8) 

Ro>I{Y,Z;V), (9) 


with Y — V — Z. Communication is necessitated by external inputs to the system. 

In the regime of abundant common randomness, the communication rate requirements of our result coincide with the rate 
region for empirical coordination in the cascade channel 121 Theorem 5] since there must exist a realization of the shared 
randomness that yields good empirical coordination codes, in agreement with 121 Theorem 2]. This observation consolidates 
the intuition that the onus of secrecy that we have taken on is borne primarily by the available common randomness. However, 
note that the rate of common randomness can be much larger than the largest communication rate. This shows again that while 
common randomness is helpful for secrecy, it also serves to coordinate the actions of the nodes. 

By the data-processing inequality ini, the choice {U,V) = {Y, Z) simultaneously minimizes both the communication rates 
at Ri > I(X;Y, Z) and i ?2 > I{X] Z). Also, the minimum achievable rate of common randomness is 


C,{X-Y-Z) 


min 


(u,vy. 


X-(U,V)-Y, 

(X,Y,U)-V-Z 


I[X,Y,Z-U,V), 


( 10 ) 


which can be viewed as a generalization of Wyner’s common information in the cascade setting. Another straightforward 
generalization of Wyner’s common information that has been considered in the literature EtII is 

C{X]Y]Z) =m.inI{X,Y,Z-U), (11) 


where the minimum is over random variables U such that given U, X, Y and Z are independent of each other. In fact, the two 
quantities are the same (without regard to constraints on the alphabet sizes). This is because the minimizers of ([TT]l and ( [T0| l 
are compatible with each other’s Markov structures. For example, if U attains the minimum for ( fTTl ), then ([/, V) = (0, U) 
satisfies the Markov chains in Note that the communication and common randomness rates cannot be simultaneously 
minimized in general. 

E. Task Assignment Example 

We now compute our region for an example we will call task assignment, where 3 out of m > 3 tasks are to be assigned to 
the 3 nodes uniformly at random. To be precise, we consider a channel Qyz\x that acts on X uniformly distributed on [m] 
i.e. Qx = ra~^ and produces a pair Y ^ Z uniformly distributed over all distinct pairs in [m] \ {2f}. This is a generalization 
of the scatter channel example in ||3l. 

We shall now explain how the Markov chains in Q simplify the computation of the rate region by letting us parametrize 
{U, V). As per ([^l, we consider joint distributions Px,y,z,u,v € D. The Markov chains ensure that the conditional distribution 
factorizes as Px,y,x\u,v = Px\u,vPy\u,vPz\v- Also, these distributions have the constraint that the supports of X,Y and Z 
cannot intersect. 

By the above Markov constraints, for any v G V, the conditional distribution Px,y,z\v=v must have support(Z) disjoint 
from support(X) U support(y). Let o„ = to — |support(Z)| > |support(X) U support(y)| for this conditional distribution. If 
we further condition on U = u, for any u GU, then Px,y,z\u=u,v=v must have all three supports of X, Y and Z disjoint. Let 
bu,v = |support(y)| < ay. This leaves |support(Ar)| < From these observations alone, we can bound the entropies 

and thus the rate region. We have the parametric bounds 

H{X\V = v) < lOgtty 
H(XIU = u,V = v) < log(a„ - by^y) 

H{X,Y, Z\U = v,V = v) < log(a„ - by^y)by^y{m - ay), 


( 12 ) 

(13) 

(14) 


which do not depend on the marginal distribution Pjjv- The above inequalities give lower bounds on the required rates. We 
can achieve these rates by letting U and V be uniformly distributed over all permissible subsets for all choices of ay,bu,v 
Thus, the rate region is given by the convex hull of the set 


i?Q e : 3a S [m — 1] \ {!}, 6 G [a — 1] s.t. 

> log(iS) 

> log(^) 

^0 > log 


(15) 


The communication rates are minimized when a = m — 1 and 6=1 i.e. given ({ 7 , V), the uncertainty is concentrated on 
X. On the other hand, the common randomness requirement is minimized when b ~ ^ and a « up to the nearest integer 
i.e. given {U,V), the uncertainty is shared equally by X,Y and Z. The tradeoff between information content of the messages 
and rate of common randomness is evident here. A plot of the optimal rate region is provided in Fig. 



Fig. 2. A slice of the rate region (B) for m = too tasks. For schemes near the optimal frontier, the common randomness rate Rg far exceeds the 
communication rates. Due to the structure of the optimal scheme in Theorem 1, the dominant tradeoff is between Ri and Rg. In general, the optimal rate 
surface is polyhedral with up to iK™—2) ygj-jjces 


E Arbitrarily Long Cascades 

Our main result of Theorem [T] can be readily extended to secure channel synthesis of a distribution QYi,...,Ym-i\x for ^ 
cascade with m > 3 nodes, witn communication rates i?i,... ,Rm-i on the links of the cascade and common randomness 
shared by all nodes at rate Rq. 

The optimal rate region is 

G K’" : 3P^ G Dm s.t. V 1 < * < TO - 1, 

R, > I{X-ur-^) 

Ro > 


( 16 ) 































where Dm is the set of distributions 


x,Y,"'-\ur 


ivr 




— 1, 1 < j < m — 2, 

{X,Yr-^) QxQ, 

ix,Y^,ui)-u;i-^ 

H{Ur-^\U,)=0, 


- Yj+i, 


(17) 


with the cardinality bounds 


m<\x\i n 


. fc=l 



+ m + i — 2, 


(18) 


for 1 < z < m — 1. The proof is similar to the proof of Theorem 1, which is presented in sections IV and V. The appendix 
contains a proof outline for the general case. 

III. Variations on Cascade Channel Synthesis 

General solutions for modifications of the problem considered above remain elusive. However, we attempt to derive some 
qualitative insights by considering two simple variations in this section. 

A. No Secrecy on Link 1 

Recall that the channel synthesis problem for the cascade setting with no secrecy constraints ll23l is unsolved. We now 
demonstrate how the rate region expands when the secrecy constraint is relaxed. Consider the network of Fig. [T] with the first 
node’s message unseen by the eavesdropper. In this case, we modify our definition of achievability Q to a sequence of codes 
that satisfy 


lim 

n—^oo 


Px’^y^Z’'M2 ~ Pm2 


Qxyz 


= 0 . 


(19) 


Due to the relaxed secrecy criterion, some of the burden carried by the common randomness can be distributed to communication 
on the first link. In fact, it may be beneficial to discard the superposition structure and have the second node perform local 
actions to generate its message. 


Consider the rate regions 


<^(m) A 

Od - < 


and 


Ri 

i?2 

i?o 

Ri + Rq 




■ ^Px,Y,Z,U,V G D S.t. 

> HX-,U,V) 

> I{X-V) 

> I{X,Y,Z-V) 

> I{X,Y,Z-U,V)+I{X-V) 


( 20 ) 


5 


(out) 

D 


= < 


Ri 




^Px,Y,z,u,v G D s.t. 

Ri 

> 

I{X;U,V) 

i?2 

> 

I{X;V) 

i?0 

> 

I{X,Y,Z;V) 

i?0 

> 

I{X,Y,Z;U,V) 




( 21 ) 


where D is defined in (|^. 

Theorem 2. Let Cioci denote the closure of the set of achievable rates under the local secrecy criterion ([T9]l. We have 


^ L/ocl Y 


(out) 


( 22 ) 


As shown in the appendix, the result is tight for any distribution which can be efficiently synthesized under the constraint 
I{Y;U\V,X) = 0. The task assignment example, where the number of tasks equals the number of nodes in the cascade 
(m = 3), is an example where the above bounds are tight. The rate region is given by all rate tuples in 


Ri 

> 

log 3 '1 


i?2 

> 

log 3 -1 y 

(23) 

i?0 

> 

log 3 J 



which yields a 1 bit discount in the rate of common randomness when compared to 

Ri > log 3 
i ?2 > log 3 - 1 

i?o > log 3 + 1 


(24) 


















the rate region given by ( [T5] l. Note that the projection of the region onto the communication rates (i?i,i? 2 ) is invariant (cf. 
II 23 I Corollary 1] and ^ Theorem 2]). Proofs are presented in the appendix. 


B. No Secrecy on Link 2 


For completeness, we also consider the case where only the transmission on the first link is to be secured. Note that the 
optimal scheme for the fully secure cascade synthesis problem reduces to concealing the transmission on the first link. However, 
we show here that the solution for this relaxed problem remains elusive. The gap is the functional relation between auxiliary 
random variables, that was established in the fully secure problem. 

Consider the network of Fig. [T] with the second node’s message unseen by the eavesdropper. We modify our definition of 
achievability 0 to a sequence of codes that satisfy 


lim 

n—>-oo 


Pxr 




-P. 


Ml 


Qxyz 


= 0. 

TV 


(25) 


Again, it may be beneficial to discard the superposition structure and have the second node perform local actions to generate 
its message. However, we present the achievable region used in the fully secure scheme to illustrate the gap that arises in the 
analysis. 

Consider the rate region 


(i?0, ??1 j R 2 ) G 


^Px.y,z,u,v G D s.t. 

Ri 

> 

I{X-U) 

R 2 

> 

I(X-V) 

Rq 

> 

I{X,Y,Z-,U) 


(26) 


where D is defined in (|^. Note that is the region Sw in Theorem 1. 

Theorem 3. Let Cioc 2 denote the closure of the set of achievable rates under the local secrecy criterion ( |25] l. We have 

QCioc2QSf‘'\ (27) 


Recall that D' = D D {Px,y,z,u,v '■ H{V\U) = 0}, so the functional relation established in Theorem 1 is really particular 
to the fully secure cascade synthesis problem. Proofs are presented in the appendix. 


C. Cascade with Relay 

Provision of a centralized source of common randomness seems like an optimistic assumption for many practical scenarios. 
A more realistic assumption is common randomness shared between adjacent nodes of a communication network. Consider 
the cascade network in Fig. endowed with this structure. 



Fig. 3. The i.i.d. sequence X'^ is given by nature. Messages Mi and M 2 are sent along the cascade at rates Ri and R 2 . Common randomness Ki and 
K 2 are shared by adjacent nodes at rates and We want Z'^) i.i.d. correlated and independent of the messages (Mi, M 2 ). 


For simplicity, we set F — % so that the intermediate node of the cascade plays the passive role of a relay. Furthermore, we 
shall assume that we have an abundant supply of common randomness on both links. This renders any secrecy requirement 
superfluous. We wish to produce a sequence F" at the final node of the cascade such that 


lim 

n—>-oo 


P 


X"Z" 



(28) 


which is a simplification of Q. The question we seek to address is: is there a communication-cost of replacing a centralized 
source of randomness with localized common randomness? 

Conjecture 1 of ||2l raises the question of whether optimal strategies for empirical coordination can be converted into optimal 
strategies for strong coordination, simply by augmenting the scheme with sufficient common randomness. The following result 




















provides a negative answer if the common randomness is not shared among all nodes. Note that the conjecture remains an 
intriguing open question for networks with a single, centralized source of common randomness. 


Theorem 4. Let Creiay denote the closure of the set of achievable communication rates under the coordination criterion ( |28] l. 
We have 


where 


and 


5, 


relay 


D. 


relay — 


Creiay ' 

~ Rrelay ; 

(29) 

(i?l, R2) G 

: ^Px,Z,U G Drelay S.t. '| 


Ri 

> I{X-U) 

(30) 

R2 

> I{Z- U) ] 


( Px,Z,U ■ 

{X, Z) ~ QxQz\x, I 


- 

x-u-z, y 

(31) 

1 

< |A| + |Z|+2 J 



The proofs for this section are provided in the appendix. 

Empirical coordination on the cascade network, i.e. synthesis of Z” that is jointly typical with X”, was studied in El 
Theorem 5]. The optimal empirical coordination scheme achieves = I{X]Z). We illustrate the difference 

with the above region by example. 

Consider the scatter channel El H-H], where {X, Z) is drawn uniformly from all pairs (X, Z) G [m]^ with X ^ Z and 
TO > 2. By arguments similar to the evaluation of (|^ for the task assignment example in section II-E, ( |30| simplihes to 

/ { G : 3a G [to — 1] s.t. \ \ 

Creiay = ConV J > log ( f) I ( 32 ) 

U * ^ “>4=) ]) 

The optimal empirical coordination scheme achieves 

R(e-mp) ^ ^ ^ ^33^ 

TO — 1 


On the other hand, setting a = to — 1 to achieve Ri — log implies that i ?2 = log to. The difference i ?2 — = 

log (to — 1) is unbounded, as a function of to. On the other hand, the optimal sum-rate in ( [30l l is about 2 bits (exact for even 
to). 


IV. Achievability Proof of Theorem 1 

A. Total Variation Properties 

Before delving into the achievability proof, we list some useful properties of the total variation distance: 

• Total variation between marginals is upper-bounded by the total variation between joint distributions El Lemma V. 1], i.e. 

\\Px - QxWtv ^ \\Px,Y - Qx,y\\tv- 

• Total variation between joint distributions reduces to total variation between marginals if conditional distributions given 
the marginal variables coincide El Lemma V.2], i.e. 

\\PxPy\X - QxPY\x\\r.v = - QxWtv' 

• The expected value of a bounded function is continuous w.r.t. total variation i.e. 

\EpfiX) - Eq/(X)| < 2fma.\\P - QWtv^ (36) 

where fmax = \fix)\. 



B. Soft-Covering 


Our random number generation scheme is hinged on a distributional approximation result, which we refer to as Wyner’s 
soft-covering lemma Q. It is implied by results on resolvability ll28l and goes back to Wyner’s work on common information 
Ho). For literature about related extensions, readers may refer to 0. 

Here is the most basic setting that the lemma addresses. Given a memoryless channel Qx\u^ we want to synthesize X" ~ 
Qx at the output. However, we would like to do it using an input selected randomly from a small codebook of C/" ^ Y\Qu 
codewords. How large does the codebook need to be? The lemma provides a sufficient condition (that is also necessary ESI) 
in order to avoid a biased X" sequence. 


Lemma 1 (Lemma IV.l in 0). For any discrete distribution Qux, let = {U^ be a codebook of sequences each 
drawn independently from Qu. For a fixed codebook, define 

^ 2"^ n 

^ E n Qx\u{xk\Uk{m)), (37) 

m—1 k—1 

the distribution induced by passing a random C/" codeword through the memoryless channel Qx\u- Then, R > I{X;U) 
implies that 


E 


Px 


Qx{xk) 


5 


(38) 




TV 


with lim„_>oo £« = 0, where the expectation is w.r.t. 

Since our synthesis scheme is based on a superposition code, we will need a generalized version of Lemma provided by 
the “generalization of Lemma 6.1 of [5]” in 161. A closely related result is the superposition soft-covering lemma |[3 Corollary 
VII. 8]. 



Fig. 4. Generalized Soft-Covering Lemma: Codewords ({/", V") drawn randomly from a superposition codebook are passed through the memoryless channel 
Qx \ uv - 


Lemma 2 (Generalized Soft-Covering ©)■ For any discrete distribution Qxuv, let be a codebook of 

sequences each drawn independently from Y[Qv- For each C B'yl, let mb)}^ be a codebook 

of sequences each drawn independently from Y\Qu\v- For fixed codebooks, define 

^ 2"^l> n 

= 2n(R^+R,) E ^ \\Qx\vu{xk\Vk{rnb),Uk{ma,rrib)), (39) 

ma = l rrib — l k—1 

the conditional distribution induced by passing random ({7",L") codewords through the memotyless channel Qx\uV’ 
shown in Fig. Then, 

Rb>I{X-V), (40) 

Ra+Rb> I{X-U,V), (41) 


imply that 


E 


Px 


Qx{xk) 

fc=l 


< s„ 


(42) 


with lim„_>,oo Sn = 0, where the expectation is w.r.t. Byl and 

(n) 

Note that the lemma places a stricter constraint on the base layer By^ of the superposition code. For example, if the V" 
codebook is too small, then we will have a biased sample of channel instances YlQx\u,v=vk’ which will hinder synthesis of 
X" even if B^}^y„ is extremely large. The lemma provides a sufficient condition to avoid this bias. 






















C. Construction of Idealized Distribution 

Assume that {Rq, Ri, R 2 ) is in the interior of Sd- Then there exists a distribution Qxyzuv S D such that the rates in (|^ 
are strictly satisfied. We now describe an idealized distribution T, from which we shall derive our encoders. This idealized 
distribution acts as a simple proxy for the actual coding scheme. 

For n > 1, let {K, Ma, Mb) be uniformly distributed on [2""^“] x ^ We shall set Mi := (Ma, Mb) 

and M 2 := Mb. Consider a codebook of 2 "(^o+^ 2 ) yn sequences (randomly drawn according to the i.i.d. distribution 
n Qv), where the codewords are indexed as v'^{Mb, K). For every 1^” codeword, we have a codebook Bun^yn of 
[/" sequences (randomly drawn by passing the C” codeword through the memoryless channel Y\Qu\v)’ indexed as 
u'^{Ma,Mb,K). 



Fig. 5. Auxiliary Idealized Distribution T: Codewords , V"^) drawn randomly from a superposition codebook are passed through the memoryless channel 

Qxyz\uv- 


Note that the Markov chains X — {U,V) — Y and [X, Y,U) — V — Z imply that Qxyz\uv = Qx\uvQy\uvQz\v, so the 
memoryless channel decouples, yielding Markov chains AT" — If^V^ — F" and X"y"{7" — — F" under the idealized 

distribution T. Precisely, we have 


= 2n(Ro+Ri) n QxYZ\uv{xt,yt,zt\ut{ma,mb,k),vt{^i^,k)) j . (43) 

mi m 2 / 

Note that both Markov chains X" — MiK — M 2 F” and X"F"Mi — M 2 K — F” are satisfied, consistent with the physical 
constraints of cascade communication. Finally, we set the operational distribution (our synthesis scheme) to be defined by 

■ (44) 

The first node picks messages according to Txi^,Mb\X", k, the second node passes u^{Ma, Mb, K) through the memoryless 
channel Y[Qy\uv ™tl the final node passes v'^{Mb,K) through the memoryless channel fJQziv- 


D. Total Variation Analysis 

We now proceed to show that there exist codebooks such that the above construction ( |44l i meets the secure synthesis criterion 
0 - First, note that by the triangle inequality and (|3^, we have (expectation is over the ((/", F”) codebook) 


E 


Px"y"Z"MiM2 — PmiM2 I I Qxyz\\ < E||Px "y"Z"MiM2 — Tx"Y"Z"MiM2 IItv- + 

A A 

Qxyz 


Txr 


2" Ml M2 


- T 


Ml M 2 


TmiM2 Qxyz — PmiM2 Qxyz 
= E||Px"y"Z"MiM2 — Tx"Y"Z"MiM2 IIti, + 
Tx"y"Z"MiM2 — TmiM2 


+ 




TmiM2 - TmiM2|Itv 


< 2E||Px"Y"Z"MiM 2 — Tx"y"Z>' 


M 1 M 2 I 


E 


Lx"y"Z"MiM2 


- T 


Ml M2 


Qxyz 


(45) 

(46) 

(47) 


Thus, we have reduced the problem to showing that 

• the idealized distribution T satisfies Q, and 

• the operational distribution P is well-approximated by T. 

































We address the former first. Note that with fixed communication {ma,mi,), the scheme may choose from VF" 

codewords, where W = {U,V). By Lemma Rq > I{X,Y, Z\W) = I{X,Y, Z-,U,V) implies that for any {ma,mi,) G 
X we have 


E 




,M2=m6 - 




with lim„_>.oo En = 0. This implies that 


E 


Tx"y"Z"MiM2 ~ TjvfiMa 


Qxyz 


,1 


E 


= E 

TV 2 

x'"-y'"' z'""mim 2 

= 1 E- V 

Oni?i O 


- X^Y^Z^MiM2 


2^1 i?i 


Qxyz 


2nRi 2 

^ 2^ E ® 


^ X^Y^Z^\MiM2 


— Qxyz 


171x1712 


T 


X"y"Z"|MiM2 


— Qxyz 


1 

2nRi 


^ — Hi ^ ^ ' 
mim2 


— Ett, , 


(48) 

(49) 

(50) 

(51) 

(52) 

(53) 


so indeed, T satisfies 0. 

Finally, we have to ensure that P is well-approximated by T. Consider the first term in (|47]i - by ([34li and (|35]l, we have 


E||Px"y"Z"MiM2 — '^X"Y'^Z"MiM2\\tv — E||^X"r"Z"MiM2if ” TrX"y"Z"MiM2if || tv (54) 

= E||Px"tf - |Itv> (55) 

since Py^Z''MiM 2 \x^k = '^Y'^z^MiXhlx^K by definition ( |44l i. Recall that Px^k = 2“"^“ J([ Qx, by the problem definition. 
Thus, it remains to argue that the idealized distribution T in Fig. generates marginally i.i.d. X" that is independent of K. 

Note that with fixed common randomness k, the scheme may choose from 2”^“^ fF" codewords, where W = {U, V). 
Since the codebook has a superposition structure, by Lemma we have that {Ra := Pi — R 2 and R^ := R 2 ) 


imply that for any k G [2"^“], we have 
with lim„_>.oo (5n = 0. This implies that 


R2>I(X;V), 

Ri>I(X;U,V), 


E 


Tx"|x=fc — Qx 


< 


(56) 

(57) 

(58) 


E|lPx"if 


Tx- 


K\\ 


x'^ ,k 




2nito 


T 


X"|X=fc 


k 

^ 2n.Ro 

k 

= (5n- 


(59) 

(60) 

(61) 

(62) 


Combining ( |47l i, ( [5^ , ( |55] l and (|62[), for n sufficiently large, we have for any e > 0 that 


E 


Px"y"Z"MiM2 — PmiM2 


< E. 


(63) 


Thus, for all n sufficiently large, there must exist a choice of Bv^ and Bi/n^yn codebooks that deterministically achieve the 
above bound, with the rate constraints specified in (|^. 


E. Comment on Achievability 

Our scheme requires local randomization at all nodes, including stochastic encoders - also known as the likelihood encoder 
II 29 I . Please refer to a, Ea for a quantitative treatment of local randomness in channel synthesis. Also, observe that while it 
is intuitive to think of the common randomness as a one-time pad on the messages, we do not need to use such a construction 
in our proof. On the other hand, it may be desirable to have a more direct achievability scheme for channel synthesis with 
explicit constructions. Some attempts have been made in this direction BOl . OTl . 






































V. Converse Proof of Theorem 1 

Let {Rq, Ri, R 2 ) be achievable. Then for e G (0,1/4) there exists a ( 2 "^“, secure channel synthesis code 

with an induced joint distribution Px'^,Y",Z", k,Mi,M 2 that 


Px"V"Z"MiM2 — PmiM2 I I Qxyz < e, 

A A rj.y 

for n sufficiently large. First, we use the triangle inequality and ( [34l i to note that 

|1Px"V"Z"MiM2 — PMiM2Px^Y^Z^\\.rv — 


Px"V"Z"MiM2 
Px^Y^Z" — T 


< 2 

Theorem 17.3.3 of ifT^ coupled with ( |67] l implies that 

+P(MiA 
<ne(log(|T'||:V||Z|)+Pi 
:= ngi{e), 

where gi (e) is defined by the above equality. Note that lim^^o 9i (c) = 0. 


(64) 


PmiM2 I 1 Qxyz + ■ ■ ■ 

A A 

" — PmiM2 I 1 Qxyz 

A A 

(65) 

PmiM2 I 1 Qxyz + ■ • ■ 

A A rj.y 

QxyzW 

( 66 ) 

Wtv 

— PmiM 2 I 1 Qxyz < 2e. 

A A 

(67) 

H(X"Y"Z"MiM2) 

( 68 ) 

i) -eloge 

(69) 

(70) 


A. Entropy Bounds 

We will need the following bounds on entropy in terms of total variation IJl Lemma VI.3]. If the joint distribution of 
Z”) is close in total variation to an i.i.d. distribution as assumed, then we have 

n 

Y, Ip{Xt, Yt,Zt; Z*-i) < ng{e), (71) 

IpiXT,YT,ZT-,T)<ng{e), (72) 


where 


5(e) 


4 


(73) 


Note that lime^o 5 (e) = 0. 

We shall use the random variable T uniformly distributed on [n], as a random time index. We also need the result |[3] Lemma 
VI.2] that for distributions Px^ and Qx", we have 


\\Px^ - QxA^. < \\Px- - QxA\^y (74) 

B. Approximate Rate Region 

We use standard information-theoretic inequalities, the physical constraint X” — (Mi, K) — M 2 and the fact that X” is i.i.d. 
and independent of K to bound the communication rates: 


nRi > H(Mi) 

(75) 

nR2 > H{M2) 

(85) 

> H(MAK) 

(76) 

> H{M2\K) 

( 86 ) 

> /(X";Mi|P) 

(77) 

>IiX^;M2\K) 

(87) 

= /(X";Mi,M2|iT) 

(78) 

= /(X”;M2,P) 

( 88 ) 

= I(X";Mi,M2,K) 

n 

(79) 

n 

= YHX^;M2,K\X^-A 

(89) 

= Yi{Xy,m^,M2,k\x^-A 

(80) 

n 


n 


>^/(X,;M2,iT) 

(90) 

= YHX^;MuM2,K,X^-A 

(81) 

i=l 

= nI{XT]M2,K\T) 

(91) 

n 

>^/(2f,;Mi,M2,iT) 

_ 1 

(82) 

= M(Xt;M2,P,T). 

(92) 

1 — i 

= nI{XT;Mi,M2,K\T) 

(83) 



= n/(XT;Mi,M2,iT,r), 

(84) 


























Finally, we bound Rq: 


nRo > H{K) 

(93) 

> H{K\Mi,M 2 ) 

(94) 

> I{X'^,Y^,Z^-,K\Mi,M 2 ) 

(95) 

> I{X^, y”, Z"; Ml, M2, iT) - npi(e) 

(96) 

>^I{Xt,Yt,Zf,Mi,M2,K)- ngi{e) - ng{e) 

+_1 

(97) 

L — T 

>nI{XT,YT,ZT;Mi,M2,K\T)-n{gi{e)+g{e)) 

(98) 

> nI{XT, Yt,Zt-,Mi,M2, K, T) - n{gi{e) + 2 g{e)). 

(99) 


The inequality ( |96] l follows from ( |70l l, while the other steps follow from ( |7T| l and ( |7^ . Making associations Zt) = 

{X,Y,Z), U = Ml and V = {M 2 , K,T), we see that the rates and Markov chains in Q and (|^ are satisfied up to the 
correction in 

Using the Caratheodory theorem for connected sets 1^ . we can show that |V| < |A’||3^||Z| + 3 suffices to ensure the 
existence of a distribution P'xyzuv preserves H{X\U, V), H{X\V), H{X, Y, Z\U, V), I{X, Y, U; Z\V) and the marginal 
on {X, Y, Z). Note that preserving I{X, Y,U-,Z\V) retains the Markov chain {X, Y,U) — V — Z. The resulting distribution is 
formed by an average over distributions of {X, Y, Z, U) that satisfy I{X, Y,U; Z) =0 and X — U — Y, thus preserving both 
Markov chains in (|^. 

Next, we apply the Caratheodory theorem again to argue that \U\ < |A’||3^||Z||V| + 3 suffices to ensure the existence of 
a distribution Txyzuv that preserves H{X\U,V), H{X,Y, Z\U,V), H{Z\X,Y,U,V), I{X;Y\U,V) and the marginal on 
{X,Y, Z,V). Preserving H{Z\X,Y,U,V) and the marginal on {Z,V) retains the Markov chain {X,Y,U) — V — Z. The 
resulting distribution is formed by an average over distributions of {X, Y,Z,V) that satisfy X — V — Y, thus preserving the 
Markov chain X — {U,V) — Y as well. 

Note, that we can redefine U := {U,V) without loss of generality, so have the functional dependence H{V\U) = 0. Using 
(|74ll we only have that 


^XYZ - QxQyZ\X\\^^ = 
< 


PxtYtZt - QxQYZ\X\\^y 
Px^Y^Z^ — Y]_Q^QyZIX 


< e. 


( 100 ) 

( 101 ) 

( 102 ) 


So far we have shown that the rates for any feasible scheme lie in 


where 


{Rq, Ri, R 2 ) 


■ ^Px,Y,Z,U,V S 79e S.t. 


i?l 

> I{X-,U,V) 


i?2 

> I{X-,V) 


Rq 

> I{X,Y,Z-U,V)-.. 



{gi{e) + 2g{e)) 

Px,Y,Z,U,V 

■ \\PXYZ - QxQYZ\x\\„y < 


X - 

-{U,V)-Y, 


(X, 

Y,U)-V-Z, 


|V| 

< |A'||3^||Z| + 3, 


\u\ 

<|A’||3;||Z||V|+2 






(103) 


(104) 


C. Continuity of Sd^ at e = 0 

The final step crucially depends on the compactness endowed by the above cardinality bounds. We would like to show that 
taking the limit e i 0 recovers Sd i-c. 

(105) 

e>0 

First, note that 

f]SD^.e2SD (106) 

e>0 


since Sd^,^ shrinks as e shrinks, and Sdo,o = Sd- 














For the other direction, consider the stricter set 


^ — 


{Rl) e K3 


^Px,Y,Z,U,V G De S.t. 

i?i 

> 

/(X; U, V) 

i?2 

> 

I{X-V) 

i?o 

> 

I{X,Y,Z;U,V) 


Note that 


Pi SD,,e C Closure [ P • 


(107) 


(108) 


e>0 


\e>0 


Suppose otherwise; there exists a rate triple (a, b, c) in the left-hand side (LHS), but not the right-hand side (RHS). Then 
let’s say a* is the smallest value of i?o such that {a*,b,c) is in the RHS, so a* > a necessarily. Since a* is the smallest value 
that guarantees inclusion in the RHS, we can pick e small enough to exclude ((a + a*)/2, b, c) from 5^ Also, we can pick 
e small enough to ensure that gi(e) + 2g(e) < (a* — a)/2. Both conditions together imply that for e small enough, we have 


(a-fa*)/2< min I{X,Y, Z]U,V) =: Rq 

Px,Y,Z,U,V^De 

^ a< R*o + {a-a*)/2 

< K- (ffi(e) + 2g(e)), 


i.e. (a, b, c) is not in the LHS, which contradicts our initial assumption. Thus, 
Next, consider the mapping 


holds. 


fiPxYZUv) = WX, Y, Z; L, L), /(X; U, L), /(X; V)), 
which yields the frontier of Pareto-optimal rates in the guise of f{D) and f{D^). Note that 

P f(D,) = f{D), 


(109) 

( 110 ) 
( 111 ) 


( 112 ) 

(113) 


e>0 


because ne>o ~ decreasing subsets (as e decreases) of the compact probability simplex, and / is a 

continuous function. Thus, we have 

f]S'o^^,=SD. (114) 


e>0 


Finally, note that closed since / is continuous and D is compact. This completes the converse proof 


VI. Summary 

Coordination in general networks remains a challenging area of study, as far as tight results are concerned. In this work, we 
provide a tight result for strong coordination - generating correlated random variables - on a cascade network. Even though 
the result extends to arbitrarily long networks, it is made tractable by a secrecy constraint that ensures that the synthesized 
sequences are independent of the communication transmissions. We also demonstrated how requiring secrecy on a subset of the 
communication links expands the optimal rate region. On the other hand, lack of a centralized source of common randomness 
was shown to shrink the optimal rate region and, in particular, the optimal communication rate region. The questions of 
whether Conjecture 1 of El is true with a central source of common randomness, and whether networks can contribute to our 
understanding of the relationship between empirical coordination and strong coordination, form an exciting direction for future 
study. 
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Appendix 

1) Proof for Arbitrarily Long Cascades (II-F): When we have m > 3 nodes in the cascade, the proof follows the same steps 
outlined in sections IV and V. 

Achievability: The idealized distribution T used for the achievability proof is constmcted as follows. For n > 1, 
let be uniformly distributed on [2”^«] x «.)] x ••• x [2"'^™-!]. We shall set 

Ml := (M{,... := ■ ■ ■ ,Mf^_i) and so on till Mm-i '■= M!^_i. Next, we use a superposition codebook 

with (m — 1) layers. 

Consider a codebook Bu^ of Ui^_i sequences (randomly drawn according to the i.i.d. distribution Q[/„_i), 

where the codewords are indexed as u^_i{Mf^_i, K). For every U^_i codeword, we have a codebook Sj/n ^ 


Qf 2 n(Rm- 2 -Rm-i) {7^_2 sequences (randomly drawn by passing the U^_i codeword through the memoryless channel 
n indexed as itT). The following layers of the codebook are built in a recursive fashion, 

conditioned on all codewords that have been selected in lower layers. The operational distribution P is defined in analogous 
fashion to (|44li, with 




(115) 


The total variation analysis, to show the existence of SCCS codes, follows section IV-D. The analysis of the communication 
rates requires the general version of Lemma found in 0. 

Converse: The steps here parallel those in section V-B. Consider a secure channel synthesis code that satisfies 


PxY-m-lMrr^-^ — Pj^r^-1 1 [ QxY^-^ < 

TV 

(116) 

for e G (0,1/4) and n sufficiently large. For the ith communication rate (1 < i < m — 1), we have 


nRi > H(Mi) 

(117) 

> H{M,\K) 

(118) 


(119) 


( 120 ) 


( 121 ) 


( 122 ) 

n 

(123) 

n 

>Y,nx^;Mr-\K) 

(124) 

= nI{XT;Mf^-\K\T) 

(125) 

= nI{XT;Mr-\K,T). 

(126) 

For the common randomness rate, we have 


nRo > H{K) > H(K\M^-^) 

(127) 


(128) 

> /(X", Yf,- n/i(e) 

(129) 

> ^ I(Xt, {Y^)t,M'^-\K) - nh(e) - nh(e) 

(130) 

> nI{XT, (Li)t, ..., M^-\K\T) - n(/i(e) + / 2 (e)) 

(131) 

> nliXr, (Fi)t, • ■ •, {Y^-i)t; M^-\K,T) - n(/i(e) + 2 / 2 (e)), 

(132) 

where the approximate inequalities follow from (|ll 6 |l, with lime^o/i(e) = liin 4 o/ 2 (e) = 0. The cardinality bound ([TSll is 
derived by using the Caratheodory theorem, as demonstrated in section V-B. Finally, the converse proof is completed by 
letting e 4 , 0 and invoking compactness of the set of distributions that define the optimal rate-region, as done in section V-C. 


2) Proof for Theorem 2 (No Secrecy on Link 1): 

Achievability: We rely on the same idealized distribution T used in the proof of Theorem 1, as depicted in Fig. The 
operational distribution P is defined in (|44|. Note that by the triangle inequality and (|3^, we have (expectation is over the 








codebook) 

E 


Px^y^z^M2 — Pm2 I I Qxyz^ < EjjPjc rtY^Z^M2 — X^Y^Z^M2 \\tv + 

A A 

Tjf"F"Z"M2 — QxYZ 

QxYZ — Pm2 QxYZ 


= E||Px"r"Z"M2 — Tx"Y"Z’^ 


M2I 


Tx^Y^Z^M2 — QxYZ 


T Mo — P] 


Mo 


< 2E||Px’*Y"Z"M2 — Tx"Y"Z"M2IIti^ + ' 
e||Tx"Y"Z"M 2 — ^Ma QxYZ 

Again, we have reduced the problem to showing that 

• the idealized distribution T satisfies ( [T9] l, and 

• the operational distribution P is well-approximated by T. 

The latter can be shown in identical fashion to the steps in section IV-D, after observing that 


(133) 

(134) 

(135) 


(136) 


E||Px"Y"Z"M2 — Tx’*f"Z’*m2||tv ^ E||Px"tf — P^x^kW^v- 

The resulting communication rate requirements are 

R2>I{X-V), 

Ri > I{X-,U,V). 

It remains to bound the second term in ( |135| ). Note that with fixed mj,, the scheme may choose from pp" 

codewords, where W = {U, V). Since the codebook has a superposition structure, by arguments similar to those made in the 
analysis of the communication rates, we have for any mb that 


(137) 

(138) 


E Tf'x"Y"Z’*|M2=mt — I I QxYZ < 

with lim„_>.oo £« = 0, when 

Ro> IiX,Y,Z;V), 

{Ri - R 2 ) + Ro> I{X, Y, Z; U, V) 

=>Ri+Rq> I{X, Y, Z; U, V) + I{X; V), 

where the final inequality follows from ( |137[ l . This implies that 

Tx’*F"Z’*M 2 — T^Ma QxYZ “ 
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Qxyz 
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27iRi 
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— 1 

so indeed, T satisfies @- 

Converse: The steps here parallel those in section V-B. Consider a secure channel synthesis code that satisfies 


Px^Y”-z^M2 — Pm2 Qxyz 


< e, 


(139) 

(140) 

(141) 

(142) 

(143) 

(144) 

(145) 

(146) 

(147) 

(148) 


for e G (0,1/4) and n sufficiently large. Using the steps in section IV-B, we bound the communication rates as 


Ri>I{Xt-,Mi,M2,K,T), 

R2>I{Xt;M2,K,T). 


(149) 

(150) 










































For the common randomness rate, we have 

nRo > H{K) > H{K\M2) 

> I{X^,Y^,Z^]K\M2) 

n 

> I{Xt,Yt, Zt; M 2 , K) - ngi{e) - ng{e) 

> nliXr, Yt, Zt-, M 2 ,K\T) - n{gi{e) + g{e)) 

> nI{XT, Yt, Zt-, M 2 ,K, T) - n{gi{e) + 2g{e)), 


(151) 

(152) 

(153) 

(154) 

(155) 

(156) 


where the approximate inequalities follow from ( |148| l, with lime^o5(e) = limeio5i(e) = 0- Similarly, we have the sum rate 
constraint 


n{Ri+Ro) > H{Mi,K) 

> H{Mi,K\M2) 

> I{X^,Y^,Z'^-,Mi,K\M2) 
>I{X^,Y^,Z^-,Mi,M2,K)-ng,{e) 

n 

> HXt, Yt,Zt-,Mi,M2,K)- npi(e) - ng{e) 


(157) 

(158) 

(159) 

(160) 

(161) 


> nI{XT,YT,ZT-,Mi,M2,K\T)-n{gi{e)+g{e)) (162) 

> nI{XT, Yt, Zt-, M 2 , K, T) - n{g^{t) + 2p(e)). (163) 

Setting X = Xt,Y = Yt,Z = Zt,U = (Mi) and V = {M 2 , K,T) results in the rate expressions provided in (|2T| with 
the Markov chains in (|^. The cardinalities of U and V are bounded by using the Caratheodory theorem, as demonstrated in 
section V-B. Finally, the converse proof is completed by letting e 0 and invoking compactness of the set of distributions that 
define the optimal rate-region, as done in section V-C. 


Task Assignment Example: Since 

I{X, Y, Z-, U, V) + I{X-, V) = I{X, Y, Z-, V) + I{X, Y, Z; U\V) + I{X-, V) 

= I{X, Y, Z; V) + I{x-, U, y) + I{Y, Z; U\V, X) 
= I{X, Y, Z; V) + I{X-, U, y) + I{Y-, U\V, X), 


(164) 

(165) 

(166) 


the result is tight for any distribution which can be efficiently synthesized under the constraint I{Y ; 17|y, X) = 0. The sum-rate 
constraints in ( |20l i and ( [2T| are redundant in this case. 

When {X, Y, Z) is a random permutation of {1,2,3}, then {X, Y,U) — V — Z ^ H{Z\V) = 0, since Z = {1,2, 3}\ {X, y}. 
We also have that Y = {1, 2, 3} \ {X, Z}, so 

I{Y-,U\V,X)=I{Y-,U\V,Z,X) (167) 

= 0, (168) 

since y is a function of {X, Z). Thus, the sum-rate constraints in (|20li and (|2l]i become redundant, and (|20li reduces to (|2l]i. 


3) Proof for Theorem 3 (No Secrecy on Link 2): 
Achievability: Using (|3^, we have 


Rx’^y^Z’^Mi — Pmi Qxyz < Px^y^z^MiM2 — PmiM2 Qxy 


(169) 


so the achievability part readily follows from the achievability proof of Theorem 1. Note that is the region Sd' in 

Theorem 1. 

Converse: The steps here parallel those in section V-B. Consider a secure channel synthesis code that satisfies 


Px^Y^z^Mi — Pmi Qxyz 


< e, 


TV 


(170) 
















for e e (0,1/4) and n sufficiently large. Using the steps in section IV-B, we bound the communication rates as 


Ri>I{Xt;Mi,K,T), (171) 

R2>I{Xt;M2,K,T). (172) 

For the common randomness rate, we have 

nRo > H{K) > H{K\Mi) (173) 

> /(X", F”, F”; K\Mi) (174) 

> (175) 

n 

>Y,I{Xt,Yt,Zt-M^,K)- npi{e) - ng{t) (176) 

t^l 

> nI{XT, Yt, Zt] Mi,K\T) - n{gi{e) + g{e)) (177) 

> nI{XT, Yt, Zt; M^,K, T) - n(pi(e) + 2g{e)), (178) 


where the approximate inequalities follow from ( |170| l, with lime^o 5(e) = lime^o 5i(e) = 0. 

Setting X = Xt,Y = Yt,Z = Zt,U = {Mi,K,T) and V = {M 2 ,K,T) results in the rate expressions provided in ( |2^ 
with the Markov chains in (|^. The cardinalities of U and V are bounded by using the Caratheodory theorem, as demonstrated 
in section V-B. Finally, the converse proof is completed by letting e 0 and invoking compactness of the set of distributions 
that define the optimal rate-region, as done in section V-C. 


4) Proof for Theorem 4 (Cascade with Relay): 

Achievability: We stitch together two point-to-point strong coordination schemes in order to achieve ( |28l l. From 0 Theorem 
II. 1], we know that i?i > I{X;U) suffices to achieve 






< e 

TV 


(179) 


for any e > 0 and n sufficiently large. The relay node generates a C/" sequence and then synthesizes a channel to Z" at rate 
i ?2 > 7(F; U), so we have 


Pu^z^ - 


< £ 


(180) 


for any e > 0 and n sufficiently large. Note that we have X” — ( 7 " 
constraints imply that 


Pu’^ - Qc/ 


— Z” under the operational distribution P. Also, the above 
< e (181) 

TV 


for n sufficiently large, by p4| ). Now, it remains to argue that (X", Z'^) is approximately i.i.d. The following steps make a 
statement that is analogous to the Markov lemma 1^ . commonly used to prove results about empirical coordination. We use 
















the triangle inequality, and ([?5]l below, along with the above statements. We have 


Px’^Z’' — Qxz 


for n sufficiently large. 


< 


< 


Pu^Px^\u^Pzr^\u^ “ XI n QuQx\uQz\ 

y^n ytl 

X ij{Quz)Px-\u- - X (n Qc/z) <5x1 

n y-n 

X - X (n Quz'^Px^\u 

yn yn 

\(J[Quz)Px’^\U’^ - (n Qc/z) Qx|c 


\Pu^Z^Px^\U 


)P: 


X"|( 7 " 


{J{Quz) 

X ij{Quz)\Px^\u^-X{ Qx\ 


X"- -li"- ,2'‘ 


\\Pu^z^ - (J[Quz)\\^^ 

< \\{I[Qu)Px^\u^ -YlQxu\\^^+e 


(]^Q(7)-Px"|C/" --Pc/"-Px"|(7’’ 


< 

< 3e, 


(n^c/) 


-Pti 


2e 


(182) 


(183) 


(184) 


(185) 

(186) 

(187) 

(188) 
(189) 


Converse: We assume that 


for e £ (0,1/4) and n sufficiently large. 
Since X" is i.i.d., we have 


P 




YJqxz 


< e 

TV 


ni?i >H{Mi) 

n 

i^l 

n 

= YHX^;M,,KuX^-^) 

i^l 

n 

i^l 

= nI{XT;M,,K,\T) 

= nI{XT;Mi,Ki,T), 


(190) 


(191) 

(192) 

(193) 

(194) 

(195) 

(196) 

(197) 

(198) 


where T is a random time index uniformly distributed on [n]. To bound R 2 , we make use of the fact that K 2 -L {Mi,Ki), 





























since Mi — {X'^,Ki) — K 2 , and K 2 -L Also, we use the constraint that {Mi,Ki) — {M 2 ,K 2 ) — Z'^. Consider 


ni ?2 > H{M2) (199) 

>/(Z";M2|A:2) (200) 

= I{Z^;M2,K2)-IiZ^;K2) (201) 

= I{Z'-^; Ml, Ki, M 2 , K 2 )-I{Z^;K2) (202) 

= I{Z^-,Mi,Ki) + I{Z^-,M2,K2\Mi,Ki) - I{Z^-,K2) (203) 

= /(Z"; Mi,Ki) + I{Z^-, K2\Mi,Ki) + /(Z"; M2\Mi,Ki,K2) - I[Z^-, K 2 ) (204) 

= I{Z^■ Ml, Ki) + I{Z^, Ml, Ki,K2)+I{Z^-,M2\Mi,Ki,K2)-I{Z'^-,K2) (205) 

= /(Z"; Mi,Ki) + I{Mi,Ki-K2\Z'^) + /{Z"; M2\Mi,Ki,K2) (206) 

>I{Z^;Mi,Ki) (207) 

n 

= ^/(Z,;Mi,ifi|Z*-i) (208) 

n 

> ^/(Z,; Ml, iCi)- 52 (e) (209) 

= n/(ZT; Ml, XilT)-52(e) (210) 

>n/(Zr;Mi,Xi,r)- 252 (e), (211) 

where the approximate inequalities follow from (|190|l, with lim^^o 92 (e) = 0. 


Setting X — Xt,Z = Zt,U = {Mi, Ki,T) results in the rate expressions provided in ( |30| ) with 2f — ?7 — Z, up to the 
correction in ( |21 l| l. The cardinality of U is bounded by using the Caratheodory theorem, as demonstrated in section V-B. 
Finally, the converse proof is completed by letting e i 0 and invoking compactness of the set of distributions that define the 
optimal rate-region, as done in section V-C. 
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